Late Latin Charter Treebank: contents and annotation
نویسندگان
چکیده
This paper describes the construction and annotation of Late Latin Charter Treebank, a set three dependency treebanks (llct1, llct2 llct3) which together contain 1,261 Early Medieval documentary texts (i.e., original charters) written in Italy between ad 714 1000 (about 594,000 tokens). The focusses on matters linguistically or philologically inclined user llct needs to know: criteria charters were selected, special characteristics types utilised, geographical chronological distribution data. In addition normal queries forms, lemmas, morphology syntax, complex philological research settings are enabled by textual layer llct, indicates abbreviated damaged words, as well formulaic non-formulaic passages each charter.
منابع مشابه
Parallel Entity And Treebank Annotation
We describe a parallel annotation approach for PubMed abstracts. It includes both entity/relation annotation and a treebank containing syntactic structure, with a goal of mapping entities to constituents in the treebank. Crucial to this approach is a modification of the Penn Treebank guidelines and the characterization of entities as relation components, which allows the integration of the enti...
متن کاملITU Treebank Annotation Tool
In this paper, we present a treebank annotation tool developed for processing Turkish sentences. The tool consists of three different annotation stages; morphological analysis, morphological disambiguation and syntax analysis. Each of these stages are integrated with existing analyzers in order to guide human annotators. Our semiautomatic treebank annotation tool is currently used both for crea...
متن کاملAutomation of Treebank Annotation
Thorsten Brants and Wojciech Skut Universit at des Saarlandes Computational Linguistics D-66041 Saarbr ucken, Germany fbrants,[email protected] Abstract This paper describes applications of stochastic and symbolic NLP methods to treebank annotation. In particular we focus on (1) the automation of treebank annotation, (2) the comparison of con icting annotations for the same sentence and (3...
متن کاملThe Annotation Guidelines of the Latin Dependency Treebank and Index Thomisticus Treebank: the Treatment of some specific Syntactic Constructions in Latin
The paper describes the treatment of some specific syntactic constructions in two treebanks of Latin according to a common set of annotation guidelines. Both projects work within the theoretical framework of Dependency Grammar, which has been demonstrated to be an especially appropriate framework for the representation of languages with a moderately free word order, where the linear order of co...
متن کاملPorting an Ancient Greek and Latin Treebank
We have recently converted a dependency treebank, consisting of ancient Greek and Latin texts, from one annotation scheme to another that was independently designed. This paper makes two observations about this conversion process. First, we show that, despite significant surface differences between the two treebanks, a number of straightforward transformation rules yield a substantial level of ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Corpora
سال: 2021
ISSN: ['1755-1676', '1749-5032']
DOI: https://doi.org/10.3366/cor.2021.0217